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In this paper we extend the Gaussian self-consistent method to permit study of the equilibrium 
and kinetics of conformational transitions for heteropolymers with any given primary sequence. 
The kinetic equations earlier derived by us are transformed to a form containing only the mean 
squared distances between pairs of monomers. These equations are further expressed in terms of 
instantaneous gradients of the variational free energy. The method allowed us to study exhaustively 
the stability and conformational structure of some periodic and random aperiodic sequences. A 
typical phase diagram of a fairly long amphiphilic heteropolymer chain is found to contain phases of 
the extended coil, the homogeneous globule, the micro-phase separated globule, and a large number 
of frustrated states, which result in conformational phases of the random coil and the frozen globule. 
We have also found that for a certain class of sequences the frustrated phases are suppressed. The 
kinetics of folding from the extended coil to the globule proceeds through non-equilibrium states 
possessing locally compacted, but partially misfolded and frustrated, structure. This results in a 
rather complicated multistep kinetic process typical of glassy systems. 
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PACS numbers: 36.20.-r, 87.15, 



. I. INTRODUCTION 



Study of the conformational transitions of heteropolymers in dilute solutions is important for many applications 
from the chemical industry to biotechnology. Directed more towards the former, there has been a significant amount 
of theoretical work carried out on concentrated copolymer solutions, mixtures and blends using various types 

of the density formalism. However, these approaches are not valid at infinitely low dilution where the fundamental 
interactions of the individual macromolecule determine its conformational state. This situation is more relevant for 
biochemistry. The problem is even harder to address at non-equilibrium conditions typical for biopolymers in vivo 

CM. @- 

For these reasons we have been working for some time on developing an adequate statistical mechanical technique 
for studying the equilibrium structure and kinetics across phase transitions in heteropolymers [|Tl|-|l3| . The main idea 
was to extend the Gaussian self-consistent (GSC) method, originally proposed for the homopolymer (see e. g. |lif| and 
references therein), to the case of inhomogeneous monomer interactions. This has been achieved in Ref. |ll] where 
we have derived the complete set of non -linear kinetic equations for complex valued equal-time correlation functions 
of the Fourier transforms of the monomer coordinates. There we have analysed in some detail the simplest periodic 
(ab) copolymer, but study of more complicated sequences remained out of our practical computational reach. The 
relation of the kinetic equations to the equilibrium free energy, as well as the expression for the entropy, were also 
unknown to us at that stage. Thus, the phase diagram have not been elucidated and certain other important issues 
have not been addressed in that work. 

In the following two works Jl2|,[l3| we have achieved further progress and resolved most of these questions, though 
, at the cost of loosing detailed information about individual sequences. Namely, we have performed averaging of the 
GSC equations over the quenched disorder in the monomer amphiphilic strength. This yielded a closed set of kinetic 
equations for description of random copolymers with a Gaussian distribution of the disorder. Such an idealised system 
is very interesting in itself, particularly since there is a hope that its study could shed light on some features of the 
extremely complex problem of protein folding flfjfl . 

To render the quenched disorder problem more tractable certain perturbative approximation was necessary |r2[ . It 



became clear in Ref. |13 that this causes some deficiencies in the equilibrium limit of the formalism, and we have 



alluded as to how these could be alleviated in higher orders of the expansion. 
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Therefore, it seems important to revisit the general formalism of Ref. in order to resolve remaining difficulties, 
maintaining the rich information about particular sequences and avoiding any further approximations. Furthermore, 
there exists, till now, no simple theoretical procedure capable of giving the equilibrium conformational states of a 
heteropolymer with arbitrary sequence, that can in a consistent manner also give the full kinetic pathway between 
these states. The work presented here achieves this. To demonstrate the strength of the extended GSC method 
we consider a number of interesting examples of heteropolymer sequences. Different kinds of interactions, such the 
hydrodynamic interaction, may be straightforwardly incorporated into the scheme. A rich and nontrivial physical 
picture emerges as a result of this theoretical progress. 



II. THE BEAD-AND-SPRING MODEL 

Traditionally, we proceed from the coarse-grained description of the polymer chain with the spatial coordinates 
X„ ascribed to the n-th monomer. It is assumed that the long timescale evolution of conformational changes is well 
represented by the phenomenological Langevin equation, which upon neglecting the backflow effect of the solvent may 
be written as, 

„ d _, dH 

where £b is the "bare" friction constant per monomer. For the discussion of the hydrodynamic interaction we refer 
the reader to Appendix [a|. The thermal fluctuations are incorporated via the Gaussian noise which, according to the 
Einstein law, is characterised by the second momentum, 

<<V)> = 2k B TC b 6 a ><*'6 n , n ,6(t - 0, (2) 

where the Greek indices denote the spatial components of 3-d vectors. 

In the current treatment the solvent is effectively excluded from the consideration |15j and the resulting monomer 
interactions are described by the effective free energy functional, 

, oo J— 1 

h = - ]T(x„ - x,^) 2 + 1 ]T(x„ +1 + Vi - 2x„) 2 + E u w II *( x ^ +1 - x «i)> ( 3 ) 

n n J=2 {n} i=l 

where in principle nr, are allowed to have any dependence on the site indices {n} = {ni, . . . , nj}. 

The first two terms in Eq. (|^) describe the connectivity and the stiffness of the chain. Their coefficients have 
the following simple meaning: k = k B T /I 2 and ae = ksTX/l 3 with I and A called the statistical segment length and 
the persistent length |i|,[l(| respectively. Apart from these interactions, local along the chain, there are also long- 
range volume interactions represented by the virial-type expansion in Eq. (||). The latter reflects the hard core 
repulsion and weak attraction between monomers, but also the effective interaction mediated by the solvent -monomer 
couplings /„. 

The coefficients in this expansion may be calculated as functions of the temperature and the parameters of molecular 
interaction. However, for our purposes we do not need to know their explicit form here as we shall keep only a few 
first terms. Appropriate coefficients then may be viewed as independent phenomenological parameters which could 
be directly related to experimentally measurable quantities. 

In Refs. @JI^1 we have discussed that the case of amphiphilic heteropolymers, for which monomers differ only in the 
monomer-solvent coupling constants, corresponds to the following choice of site-dependent second virial coefficients 
in Eq. ©, 

^ = « (2) + i(/„ +/„o, = ( 4 ) 

n 

The mean second virial coefficient vS 2 ^ is associated with the quality of the solvent: positive values correspond to the 
good solvent (where effective two-body repulsion of monomers results in the extended coil conformation), and the 
negative values correspond to the bad solvent condition (where the effective two-body attraction tends to compact 
the chain). 

The set of couplings {I n } expresses the chemical composition, or using the biological terminology, the primary 
sequence of a heteropolymer. For simplicity, in the consequent sections we shall consider examples for which these 
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constants can be parametrised in a simple way: /„ = Aer„, where variables a n can only take three values: —1,1 
or corresponding to (a) hydrophobic , (b) hydrophilic and (c) "neutral" monomers respectively. The parameter A 
is called the degree of amphiphilicity of the chain. Note that for more complicated than binary sequences there is 
another relevant dispersion, (A^) 2 = jr ">^ m cr 2 ^, and the combination AA„ is a more appropriate variable. 



III. THE GSC KINETIC EQUATIONS 

In work jllj we have derived a set (18) of the GSC kinetic equations for the equal-time correlation functions of 
the Fourier transforms of monomer coordinates T^ A (t). There, at the end of Sec. II, we have mentioned that a 
polymer with no periodic structure may be described by choosing the number of blocks M — 1. The equations for 
the correlation functions, 

T mw! (t) = Vx ro (t) X m , (t) \ , (5) 

contain some redundancies and also are intermixed with the diffusive degree of freedom describing the motion of the 
centre-of-mass. Obviously for a single chain the latter can be easily decoupled from the intra-molecular degrees of 
freedom by introducing the mean squared distances between pairs of monomers, 

Drnm'it) — ~Z \ (^m (0 -^-m'(^)) / ^~ mm ~t~ m'm' 2.7** mm' • (6) 



It turns out that from Eq. (18) in Ref. |11| after certain algebraic manipulations one can obtain a closed set of 
equations for the quantities D mm t{t). 

Taking into account the additional bending energy contribution in Eq. ([|), the GSC equations may be written 
down in the most general form as follows, 

~^2~^^ rnrn ' — 2/c^T(l $mm') -\~ k(D mm / ,m+lm ~\~ ^mm' ,m — \m ~t~ {jTi < ► Til )) 

c&(Dmm' ,rn-\-2 m+1 ~t~ ^rnm' ,rn — 2 m— 1 ^^mm' ^m — \m 3-D mm ' ,771+1 m ~t~ {jYl Tfl )) 
00 fi(J) J-l 

+ \^X~^ inl V' A (J ~ 1 V^ m 4- <->'"' — S m — S m ')D , (7) 

^ 2^, (A et /±(J-l))5/2 V ^°ni +0 n (+ i °n i+1 °m )^mm' ,nm j+ i , (< ) 

J=2 {n} V ' i,j=l 



where we have introduced the four-point correlation functions, D mm >, nn >, and the matrix A^ 7 ^ of size (J — 1) with 
the cofactor A,-' 7 1 , 

Emm' ,nn' — ~^J^mn ~l~ -^m'n' ^mn' -^m'n); (8) 

A^ = D nini+li „ inj+1 . (9) 

Similarly to Eq. (31) in Ref. |ll|] for the mean energy £ = (H) we have, 
3^. ^ 3gg ^ 

& ^ ^ D n n _ X,nn— 1 ~t~ ^ n,n+l n ~t~ n,n—ln n+l,n— ln+l) 

n n 
oo 

+EE"w( detA(J_1) )" 3/2 - ( 10 ) 

J=2 {„} 

It is interesting to note that in the case of the homopolymer the right-hand side of the kinetic equations (0) may be 
rewritten via the instantaneous gradients of the variational free energy by introducing the normal modes (see Eq. (5) 
in Ref. JlTj]). This establishes connection of the stationary limit of our kinetic approach with the equilibrium theory, 
in which we recover the Gibbs-Bogoliubov variational principle. We also note that if first order phase transitions are 
involved, one has to possess the expression for the free energy in order to determine the phase boundaries by finding 
the global free energy minimum. 

The variational free energy, A, based on the Gaussian Ansatz for equal-time pair correlation functions, contains 
two terms, A = £ — TS. Naturally, the mean energy term is given by Eq. (110). The second, entropic contribution, is 
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calculated in Appendix [BL Let us summarise the result of that calculation here. One representation for the entropy 
that is suitable for numerical analysis is, 

S =^k B logdetR ( - N - 1 \ (11) 
where we have used the determinant of the (N — l)-dimensional major submatrix of the matrix, 

Rnn' — J^2 ^nm.tiV = — * ^ * ^ '> 

mm' 

and the matrix D obviously has the elements equal to D nn i. The reason for appearance of the truncated matrix is 
that we have excluded the zero eigenvalue of R related to the translational invariance. Here we have also introduced 
the (N — l)-dimensional orthogonal projector A such that, 

(A°) 2 = A°, (A°) T = Ao, $>V = 0, (13) 

n 

with the matrix elements (A°)„fc = S n ^ — 1/N. This matrix has obviously one zero eigenvalue and N — 1 degenerate 
eigenvalues equal to 1. 

However, for analytical treatment it is more convenient to obtain a slightly different expression by regularising the 



zero eigenvalue (i.e. by imposing the constraint J2 n = as a "soft" condition in Eq. (Bl)), 



S= 3 -k B lunlo g d ^ R + el \ (14) 



Since the matrix (R + el) is nondegenerate we can easily differentiate Eq. ( |l4] ) so that, 

fl / BR 



Q lk = lim J2 D ^d^- TT 1 °Z( R + el ) = E D v Tt ° ( qd~. * V ) = ( A % . ( 15 ) 



where inside the trace Tro over the (N — l)-dimensional subspace projected out by A the matrix R becomes invertible 
with the inverse denoted as . This allows us to obtain the combination which appears in the kinetic equations, 

Qu + Qkk — Qik — Qki — 2(1 — 5i,k)- (16) 

These preliminaries are sufficient to prove the desired relation. Indeed, using Eq. ( |l6| ) by direct and tedious 
differentiation of Eq. ( ^0| ) one can show that the kinetic equations (Q) may be expressed through the instantaneous 
gradients jl8| of the variational free energy as, 

££JW*) = -~£(A^) - D m , m „(t)) (? Am \ - ° Am l) ■ (17) 
2dt mmW ™ U K " \dD mm „(t) dD m , m „(t) y ' 

m" x ' 

This formula, together with Eqs. (|Io| , |l4] ), is the key formal result of the current work. The structure of Eq. ( |l7| ) is 
sufficiently nontrivial to be guessed from phenomenological arguments and has been derived in a systematic manner 
proceeding from Eq. (^). 

We would like to comment here that although, for simplicity, we have presented the explicit formulae above for 
a ring polymer, our current formalism is general and covariant. In fact, the kinetic equations (]l7]) are valid for any 
topology of the chain. Thus, it is straightforward to consider more complicated topologies such as a star, brush, 
network, branched chain and so on. For that it is sufficient to modify only the spring and stiffness terms in Eq. 
(H), and respectively in Eq. (|lo|). For example, to describe an open chain one has simply to suppress the term with 
ii = in the connectivity contribution, and the terms with n = 0, N — 1 in the stiffness contribution to the energy. 
We undertake a detailed comparison of ring versus open homopolymers in kinetics at the collapse transition of the 
homopolymer in a separate work |l9| . Another interesting possibility is that the general equation (|l7|) is also valid for 
models with different ways of representing the connectivity and stiffness. For example, the freely rotating model Q| 
can be obtained by suppressing the first two terms in Eq. (|ITJ) and instead keeping fixed the following mean squared 
distances: -D m ,m+i = &o and D m m+ 2 = 46Qsin 2 9/2, where bo and 6 are the bond length and angle. This is easy to 
prove by adding appropriate "soft" constraints to the free energy functional and taking the consequent limit in Eq. 
(|l7|), so that they become delta-functions in the partition function. 
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Finally, let us introduce two main observables: the mean squared radius of gyration and the degree of micro-phase 
separation fl~2]| , 

R l = ^pY, D mm >, * = N2 R ] AAa E ("w - u^)D mm ,. (18) 

mm' 9 mm' 

The second parameter has a meaning of the dimensionless correlation of the matrices of the relative two-body 
interaction and the mean squared distances. For heteropolymers with two types of monomers it characterises the 
difference between the mean squared radii of gyration of hydrophilic and hydrophobic species, and for the symmetric 
composition with their numbers equal we have a simple relation: ^ = (R 2 (b) — R 2 (a))/(2R 2 ). 

IV. NUMERICAL RESULTS 



The self-consistent kinetic equations ( JT7| ) have been studied numerically using the explicit formulae 
the effective potentials (see Appendix |c]) , and the expression given by the first term in the right-hand side of Eq. (|7|) 
for the entropic contribution. We used the fifth order adaptive step Runge-Kutta method |Q to improve stability of 
the solution which, for large amphiphilicity parameter, A, is characterised by a rather rugged free energy landscape. 
We note that such kinetic method for finding the equilibrium distributions is more reliable and efficient than the 
standard methods of free energy minimisation if there are lots of mountains and valleys on its surface. We also refrain 
from study of the influence of the hydrodynamics in this paper for the analysis and results are complicated enough 
already. Besides, the hydrodynamics in the preaveraged approximation does not affect the equilibrium state, which 
is recovered by taking the stationary limit in the kinetic equations. 

We include the volume interactions up to the three-body terms only, i.e. = for J > 3. As can be seen from 

) the computational time per time step scales with the chain length as t c ~ N 3 here. This performance 
is intermediate between that of the homopolymer t c ~ N 2 [jl^JlTj and that of the random copolymer t c ~ TV 4 jl2],[l3| . 
The performance of the formalism in Ref. jTl| was t c ~ A" 4 M^where K and M are the block length and number of 
blocks respectively. Besides, that formalism relied on the use of complex variables, and the unitary transformation to 
the real basis was not an easily automatised task for complicated sequences. Moreover, the treatment of the diffusive 
mode was nontrivial and sequence dependent. Thus, in every respect, the current scheme is most attractive for study 
of heteropolymer sequences from the computational point of view. 

It is natural to work here with combinations C = (fc^T '/k) 1 / 2 and T = C,b/k as the units of size and time. We choose 
k = 1, ksT = 1 and £& = 1 to fix C and T equal to unity. In addition, we fix the following interaction parameters: 
the third virial coefficients u^ m , m „ = 10 ksTC 6 and the stiffness ae = as we did in Ref. pT[ . 

Now, we turn to discussion of concrete results. We have studied ring chains of the lengths N = 30, 60, 90. We shall 
present here the most complete results for N = 60 and discuss the N dependence only briefly. We have examined 
many different sequences. However, we present our data in this paper only for three particular choices which we found 
most typical and illustrative of the heteropolymer behaviour. These are chains of: 30 (ab) blocks, 10 (aaabbb) blocks 
and 2 (abacbbcabccbcaaacbcccbbaacbcca) blocks, which we call the "short" blocks, "long" blocks and "random" 
sequences respectively (see the end of Section || for the monomer notations). 

A. Equilibrium phase diagrams 

The phase diagrams in terms of the mean second virial coefficient ui and the amphiphilicity A for the above 
mentioned three sequences are presented in Figs. For positive values of u' 2 ' and comparatively small values of 
A the conformational state of the chain is akin to a homopolymer extended coil (see Fig. [|a). By decreasing to 
the negative region the chain is caused to undergo a continuous (second-order-like) collapse transition (curve (C)), 
that is characterised by a rapid fall of the radius of gyration Q (see Fig. 1 of Ref. |Q) and the change of the fractal 
dimension. Proceeding from the collapsed globule at a fixed negative vS 2 ^ the increase of the amphiphilicity A also 
causes a continuous transition (curve (S)) to the micro-phase separated (MPS) globule |^2[| . During this transition the 
system size, R g , monotonically increases (see Fig. |J) and the mean energy E decreases in agreement with our earlier 
results in Refs. [ p3|JTT| . The former change is more pronounced for the "long" blocks compared to other sequences, for 
which the connectivity constraints impede formation of structures with a hydrophilic shell and hydrophobic core. The 
MPS order parameter ^> (see Fig. ^J) increases almost linearly for small A, then after the transition asymptotically 
saturates. 
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The transition from the coil at large values of A to the MPS globule turns out to be more complicated, and 
essentially dependent on the sequence. In case of the "long" blocks (Fig. ||) the collapse transition to the MPS globule 
becomes discontinuous (first-order-like) . Thus, inside the boundaries of metastability, designated by spinodals I' 
and I" , there are two competing minima of the free energy: the coil and the MPS globule. The former minimum is 
characterised by a large size, R g , (on the right of Fig. fjj) and a small MPS order parameter 'J (Fig. |J), while for the 
latter minimum the situation is reversed (see the left hand side of Figs. @J|). The depths of the free energy minima 
become exactly equal on the transition curve I in Fig. |^. 

For the "short" blocks (Fig. |l|), as well as for many random sequences (such as in Fig. ||), the phase diagram is 
much more complicated. Starting from some value of A and for intermediate values of u^ 2 ' there appear additional 
solutions corresponding to local minima of the free energy. The broad region where this could take place is bounded 
by the curves F and II" in Figs. |J|. With increasing A the number of such solutions grows quickly. Significantly, in 
a region of the phase diagram some of these become the main free energy minimum. As the number of such solutions 
grows roughly exponentially with the chain length, we do not attempt to draw all their boundaries of (meta)stability. 
Instead, we shall designate them collectively as frustrated phases, explaining this terminology below. 

An important point here is that, as our analysis shows, these solutions become dominant in a narrow region of the 
phase diagram due to a subtle competition between the mean energy and the entropy. The MPS globule is entropically 
unfavourable there because the overall shrinking force is insufficiently strong. The values of R 2 g and "J are intermediate 
for these solutions and lie between those of the coil and MPS globule (see Figs. p| JlO|^_ In this sense, we can call them 
nonfully compacted and misfolded states. In comparison, the MPS globule (see FigT^d) has a more compact size and 
better optimised volume interactions, what is manifested in a higher value of ^ . 

Most interesting is the local structure of these additional phases. Let us discuss the particular example of the 30(ab) 
sequence. The formalism of Ref. p"l| ] has used heavily the assumption of certain symmetries for the mean squared 
distances D mm i due to which these variables can take only 3M independent values, where M is the number of blocks. 
These symmetries are: the block translation invariance, 

D m+K i, m '+Ki = A™,™', for any m,ra' and i, (19) 

where K is the block length, and the more complicated inversion symmetry discussed in detail in Ref. |[Ll]| . These 
symmetries have a simple meaning — a renumbering of monomers does change the average properties over the 
statistical ensemble — the interactions of a ring chain remain the same. However, the maximal possible number of 
dynamical variables in the GSC method is much larger and equals to N(N — l)/2. Surprisingly, it turns out that 
the frustrated phases are characterised by spontaneous breaking of these symmetries, and therefore only the current 
version of the method that takes into account all degrees of freedom can describe them. Thus, the property fll9| ) is no 
longer valid. This phenomenon describes formation of local frustrated heterogeneities (see Figs. ||b,c) in which pieces 
of the chain form MPS clusters [^3) that are prevented from further coalescing by their hydrophilic shells and high 
entropic barriers. 

The role of spontaneous symmetry breaking is well recognised in equilibrium statistical mechanics. What is striking 
here is that the number of distinct spontaneously broken states becomes huge for large system sizes. This diversity 
and a special foliating structure of various branches leads in the thermodynamic limit to what is known as a spin 
glass like frozen phase |24j of random copolymers (see e.g. Refs. Jl^Jl^l and n umero us references therein). We shall 



return to the issue of spontaneous symmetry breaking in kinetics in subsection IV B . 

Even though the kinematic symmetries are not present from the outset for arbitrary, or random, sequences, the 
structure of the phase diagram (Fig. ^) and behaviour of main observables (Figs. remain very similar. It is the 

particular structure, number and the shape of boundaries of frustrated phases that are very sensitive to the sequence. 
The symmetry that may be broken in this case has a subtler, dynamic, meaning and may be expressed in terms of the 
replica formalism p4| . In a sense, for a very long periodic chain, blocks may be viewed as identical copies of a smaller 
block-length chain. Thus, it is by no means surprising that the replica symmetry breaking in our case for periodic 
systems takes such an explicit manifestation in the breaking of the block translational symmetry. This important 
point was completely missed, nor could it be discovered, in our considerations of Ref. [TT| . Therefore we have achieved 
here a new significant insight into the problem by a simple extension of the GSC method. 

An interesting feature of our phase diagrams is that the region between spinodals I' and II" , designating where 
the frustrated phases can exist, expands dramatically with increasing chain length. For example, at A = 25, for the 
15(ab) sequence it lies approximately between —19 < u' 2 ' < 10, while for the 30(ab) — between —39 < v> 2 > < 11, and 
the curve II" goes nearly vertically downwards for larger N. According to the above interpretation, for infinitely long 
chain somewhere in this broad region and close to its boundaries there are the actual glass transition curves, which 
should be determined using proper glass order parameters. Thus, for short chains the curves I' and II" may be viewed 
as approximate indicators of the freezing transitions. The former distinguishes between the homopolymer-like and 
the random coil, while the latter — between the homopolymer-like (liquid) and the frozen globules. As for the region 



G 



of stable MPS globule, it gets relatively smaller for larger systems. That is not surprising — the frustrated phases 
expand and the phase separation involving larger spatial scales requires stronger interactions. Having understood the 
identifications for the spinodals, we now can recognise in Figs. U0 the main features of the phase diagram, though 
rather distorted, of random copolymer model presented in Ref. 13T . 

Finally, let us comment on the phase diagram of the "long" blocks (Fig. §. The micro-phase separation is obviously 
easier in this case and it dominates for large values of A, so that the frustrated phases are suppressed. We found 
that, qualitatively, in order to form a frustrated phase, in a finite range of A values, the number of (not necessarily 
identical) pieces of the chain with competing interactions should be larger than some critical number, in principle, 
weakly dependent on N. Here are a few examples conforming to this qualitative criterion: the phase diagram of 
10(ab) behaves roughly as for lO(aaabbb), but for 15(ab) it behaves similarly to 30(ab); while for 15(aabb) the phase 
diagram looks like for 15(aaabbb) at small and moderate values of A, it becomes as for 30(ab) for much larger values 
of the amphiphilicity. It is reasonable to conjecture therefore that for a large number of (aaabbb) blocks, as well as 
for extremely high values of A and just 10 blocks, the frustrated phases may be found again. 



B. Folding kinetics 



Here we shall consider the time evolution of the conformational state of the system away from its initial equilibrium 
after it has been subjected to an instantaneous temperature jump that causes the two-body interaction parameters 
vS 2 ^ and A to change. We are interested in quenches from the homopolymer coil, where all monomers are equally 
hydrophilic (u^ > and A = 0), to the region of parameters corresponding to the MPS globular state, so that the 
'a' species became strongly hydrophobic and the 'b' species remained weakly hydrophilic (u^ <C and A > |m^|). 

The temporal behaviour of the mean squared radius of gyration, R 2 (t), the MPS parameter, ^(t), and the instanta- 
neous free energy, A(t), in kinetics of folding for different sequences is presented in Figs. nT[-13l For the homopolymer 
(curves (A)) R 2 , and A decrease monotonically to their final equilibrium values, while the MPS parameter, 'F, remains 
identically zero for there is no distinction between different monomer species. These curves agree with the earlier 
results of Ref. H] and serve for reference purpose here. 

Now let us discuss the curves (B) corresponding to the periodic (ab) sequence. In the previous subsection we have 
mentioned that the current formalism yields results consistent with those of Ref. [ pT| ] only beyond the parameter 
region of the frustrated phases, which are characterised by spontaneous breaking of the block symmetry. In kinetics 
the situation is somewhat similar, but the consistence with the previous simplified formalism is even more limited. 
Really, the mean squared radius of gyration remains close to that of the "effective" homopolymer (curve (A)) during 
the first kinetic stage (see Eq. (74) in Ref. pj]|). Interestingly, this is not quite so for more complicated sequences. 
The MPS parameter, 'F(t), for the (ab) copolymer grows slowly in a way similar to the splitting of the Fourier modes 
Fq 1 ^) — J-q°(t) for large indices q (see Fig. 4 in Ref. pi]]). However, contrary to the strange conclusion in Ref. pH 
that the kinetics of the (ab) copolymer proceeds faster than for the homopolymer, we now have a, natural from the 
point of view of Monte Carlo simulations | p3| , slowing down of copolymer kinetics. Analysis of D mm /(t) shows that 
this effect is entirely due to the spontaneous breaking of the block symmetry in kinetics, something that has not been 
accounted for in Ref. fill . Indeed, in Fig. |lj we exhibit the time dependence of the mean squared distances between 
two nearest hydrophobic monomers D2k.2k+2(t) for the (ab) copolymer in kinetics after a quench to the MPS globule 
region. For early, as well as for late, times these functions for different k are exactly equal to each other. However, 
there is well defined intermediate period in time where the block symmetry is spectacularly broken (see Fig. |l^). We 
remark that the symmetry breaks and restores in a step-like manner (e.g. -Do. 2 and join together at t ~ 10, 
earlier than with other functions) and also that this effect is relatively strong. 

We would like to make a general comment here on the nature of spontaneous symmetry breaking in kinetics. Thus, 
normally in such situations at equilibrium there exist a thermodynamically unstable symmetric free energy minimum 
and a disjoint set of symmetry broken minima, which may be transformed to each other by the residual subgroup of 
symmetry transformations. These states may also be obtained kinetically as infinite time limits of the time evolution 
starting from any, for example, the symmetric initial state, which happens to be the main free energy minimum before 
the quench. 

However, the formal structure of the GSC kinetic equations ( p"7j ) is such that they yield a symmetric solution at any 
moment in time provided one proceeds from the symmetric initial condition. A question arises then — how can one 
obtain the kinetics that could lead eventually to the multitude of final states with broken symmetry? The answer is 
clear in the exact theory — there are fluctuations that can transform between different spontaneously broken states 
in kinetics. 

The GSC method presents, though an optimised and improved, but still a mean field type theory, where such 
fluctuations are not properly included. Manifestation of the kinetic spontaneous symmetry breaking takes a different 
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form there. Namely, at some moment in time the symmetric solution of the kinetic equations becomes unstable with 
respect to perturbations (whether of the initial condition, or of the interaction matrix, u ml ,). Thus, for example, one 
can add an infinitesimal symmetry breaking term s nn i to the two-body interaction matrix and consider the limit of 
vanishing perturbation in the solution. Different choices of the form of £ nn > in the unperturbed limit would yield all 
possible types of spontaneously broken kinetic evolution, which are, of course, equivalent to each other. Numerically, 
such a regularisation procedure is not even necessary. There is always an intrinsic perturbation due to computer 
round off and numerical integration errors. Thus, if the symmetry is favourable to be kinetically broken somewhere, 
numerically one obtains one of the spontaneously broken solutions there, rather than the unstable symmetric solution, 
unless the symmetry conditions have been imposed by hand. In this situation improvement of the numerical precision 
would have no profound effect — the kinetic process either does not change, or can change only up to an experimentally 
unobservable residual symmetry transformation. 

We note also that such a procedure of perturbing the interaction matrix here is analogous to introducing an external 
magnetic field in the Ising model. The spontaneous magnetisation of a macroscopic sample of ferromagnet may be 
achieved by gradually switching off the external magnetic field. In the absence of the field there would remain domains 
with long-range order, but varying directions of the spontaneous magnetisation canceling each other. Unfortunately, 
in our case it is not evident how to experimentally implement an analogous gradual switching off of e nn t . 

Now returning to our results, in Fig. [If] one can see that the folding kinetics for the (ab) copolymer is about 3 
times slower than for the homopolymer of the same length, the effect being even stronger for other sequences we 
have considered. From Fig. [l2] and the phase diagrams in Figs, [p-^ it is evident that the considered quench results 
in the final state of the MPS globule for the "long" blocks (curve (C)), whereas in the frustrated phases for two 
other sequences (curves (B) and (D)). For the latter phases the MPS parameter is smaller and the radius is larger 
than for the MPS globule, as we already know from the equilibrium considerations. It is interesting to note that 
the final relaxation to the frustrated phases may be rather unusual. For example, in Figs. for the "random" 

sequence R 2 g (t) increases and ^(t) decreases during the last kinetic stage, what is the converse to the behaviour at 
final relaxations in other cases. Another unusual observation is that for some sequences the parameter may even 
become negative during some time in kinetics, something we never observed at equilibrium. This shows that the 
structure of non-equilibrium conformations can be very complicated. 

The instantaneous free energy A(t) depicted in Fig. [f3| turns out to be the quantity most sensitive to the con- 
formational structure of the non-equilibrium state. From that figure it strikes that, while the homopolymer folding 
is a single smooth relaxational process, the folding of heteropolymers proceeds through a multistep acceleration- 
deceleration process. The flat regions of a staircase-like function correspond to temporary kinetic arrest of the system 
in transient non-equilibrium (mostly symmetry broken) conformations. Generally speaking, in the GSC method we 
deal with the time evolution of a statistical ensemble of various initial conformations. The flat regions appear due to 
transient trappings of various members of the ensemble in their local shallow energy minima. Since such minima are 
encountered at different moments in time for different members of the ensemble, their influence on the overall time 
evolution of averaged observables is manifested in a smooth characteristic slowing down. 

Qualitatively then, summarising the data for various sequences, most of which we have suppressed here, we can 
say that the number of local minima affects the number of stair steps, whereas the depth of minima determines the 
lengths of the steps. This interpretation is highly supported by the strong dependence of the stair case structure on 
the chain length, the sequence and the interaction parameters, for it is known how the frequency and depths of local 
potential energy minima depend on these factors. On the one hand, it is known that for a long heteropolymer chain 
the number of local minima grows exponentially with the chain length TV, and indeed we see that the number of steps 
in A(t) grows just as quickly. On the other hand, increasing of the amphiphilicity, A, leads to higher depths of local 
minima as well as an increase in their number, and really the steps in A(t) become longer and more numerous. Let 
us give another example of such connection. For the "long" blocks the interactions are less frustrated than, say, for 
the (ab) blocks, thus there are much fewer of local energy minima, but the barrier between the coil and MPS globule 
is a higher. As a result, the kinetic process (curve (C) in Fig. 13) proceeds through only one rather long kinetically 
arrested step. 

It is worthwhile to make a comment here on the notion of kinetic stages introduced in earlier works ]25| , p^ , p3[ . 
Those we associated with typical structures of the conformation and accompanying kinetic laws. We distinguished 
at least the following kinetic stages at the collapse of the homopolymer: early time necklace formation, middle time 
coarsening and a number of final relaxational processes. The multistep character of folding observed in this work 
affects the middle kinetic stage, resulting in its considerable complication and splitting into multiple substages with 
respective complex kinetic laws that are determined by the sequence. Universality of such kinetic laws is doubtful, 
but probably it can be recovered by averaging over certain classes of sequences with similar folding properties. 

Apparently, in the GSC method the kinetics is a motion in the space of N(N — I)/2 averaged dynamic variables, 
D nn ' (t), and it is determined by the profile of the free energy. It may be instructive, using Figs, [if] and [fj|, to present 
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the kinetics via a parametric plot of A vs R g , the latter being the main, though not the only, relevant "coordinate" 
of the system. That would produce a kind of "bottleneck" picture that was much discussed by P. Wolynes and others 
p6{ in relation to the protein folding problem. This indicates that our method produces behaviour that permits 
interpretation in terms of phenomenological energy landscape models. 

Finally, let us utilise once again the connection between the kinetic evolution of the free energy and the ruggedness 
of the potential energy landscape. This would allow us to shed some light on the general structure of the energy 
landscape for complex heteropolymers. Thus, a typical example of the "random" sequence kinetics (curve (D) in 
Fig. [II]) shows first a fast drop, then appearance of short steps that are growing longer with time, until the last step 
becomes infinite. This translates to the following equivalent energy landscape versus some collective coordinate: first 
there is a rapid drop away from the unstable coil state, then the surface flattens and small wrinkles appear on it, they 
are growing larger in the amplitude gradually becoming high mountains and deep valleys, until there is eventually 
a very deep ravine corresponding to the "ground" state of the system separated by a very high barrier from other 
minima. In our view, the latter picture bears a remarkable resemblance to a typical mean field energy landscape in 
spin glass systems [^4| recently discussed by G. Parisi p7] . This observation seems encouraging to us and indicates 
that the GSC method is capable of describing very complex systems, though it is too detailed and expensive in 
the complete form for description on a macroscopic scale. Nevertheless, by taking the quenched disorder averaging 
(similarly to Ref. E]) it is possible to achieve an adequate description for heteropolymers that is alternative to the 
replica formalism. We believe that the underlying connection between both types of approaches has been clarified to 
some extent by the current work. 



V. CONCLUSION 



In the present work we have extended the Gaussian self-consistent method to permit study of heteropolymers with 
complicated primary sequences. This has been achieved by transforming the kinetic GSC equations, earlier derived for 
the Fourier modes, to the form containing only the mean squared distances between pairs of monomers and relating 
these equations to the instantaneous gradients of the variational free energy calculated based on the Gibbs-Bogoliubov 
principle. The revised GSC formalism possesses important computational advantages, but also it is fundamentally 
superior to its predecessor in that it can describe phenomena of spontaneous symmetry breaking and formation of 
structural heterogeneities. 

We then have applied the extended GSC method to some particular amphiphilic heteropolymers. The equilibrium 
phase diagrams for these have been obtained in a systematic manner. Apart from the coil and two globular states we 
have discovered that in a wide intermediate region of the phase diagram there may be a large number of frustrated 
partially misfolded states. Despite the fact that such states, the number of which grows exponentially with the 
chain length, are mostly metastable, some of them become the dominant state in their rather narrow domains. The 
corresponding potential energy profile of the system has strong resemblance to that of a typical spin glass system. 
Thus, we may conclude that the transition to these states in the thermodynamic limit corresponds to a glassy freezing 
transition. 

We note that an observation that for sufficiently long sequences of alternating monomers ("short" blocks) the 
micro-phase separated state is displaced by the region of glassy frustrated phases has been known for quite some time 
and well understood in the framework of the density theories H as a result of the density fluctuations which modify 
the mean-field behaviour. The block translational symmetry breaking in our approach is another manifestation of the 
glassy phenomena earlier extensively studied for melts of random heteropolymers. In addition, we can see how the 
destruction of the micro-phase structures occurs for finite-sized systems and how exactly it depends on a particular 
sequence. 

The folding kinetics is found to be strongly affected by the presence of these transient frustrated states along the 
kinetic pathway. This leads to a complicated kinetic process consisting of multiple steps with pronounced slowing 
down and then acceleration in the folding rate. It is interesting to note that a typical fairly heterogeneous chain 
with weakly hydrophilic and strongly hydrophobic units folds kinetically first to one of the misfolded states and can 
undergo a consequent nucleation process to the main thermodynamic state of the micro-phase separated globule. 

These results for equilibrium and kinetics confirm many predictions of the earlier treatment adopted for random 
heteropolymers in Ref. Jl2| based on a quasi-perturbative averaging of the GSC equations over the quenched disorder. 
The latter approach describes a large set of random heteropolymer sequences with a Gaussian distribution of the 
monomer amphiphilicity. The kinetics of folding seems to be consistent between the previous and current approaches 
with respect to main observables such as the radius of gyration, energy and the MPS parameter. A smooth slow 
glass-like relaxation in the former approach is produced by averaging over all different sequences with their particular 
multistep processes of passage through the frustrated states. Although here we observe strong dependence of the phase 







boundaries in the equilibrium phase diagram on the primary sequence, the major conclusion of Refs. p2| , [l3[ that there 
are three distinct globular states of the homopolymer-like (liquid), the frozen (glassy) and the micro-phase separated 
(folded) globules, and an additional state of the random coil, as well their relative location in the diagram, remain 
valid. We have previously discussed that the equilibrium limit of the theory in the treatment of Rcf. |13| had certain 
deficiencies due to the additional perturbative approximation, which we attempted to remedy phcnomcnologically. 
The current fundamental scheme is free of any such problems and provides a reliable test ground for development of 
further approximations. 

We have also discovered that heteropolymer sequences of equal length can be roughly divided into distinct classes 
possessing similar phase diagrams and kinetic folding properties. For instance, periodic polymers with a few long blocks 
easily undergo micro-phase separation and for them the frustrated states are suppressed at equilibrium and in kinetics. 
There is a view in the scientific community that complex random sequences also permit more refined classification. The 
identification of good folding sequences (see e.g. Rcf. |pq| ) is believed to be an important prerequisite for unraveling 
the mysteries of proteins. 

In the current paper the GSC method has been applied to a single chain problem. However, since the kinetic 
equations (^) are covariant it would be relatively straightforward to apply it to solutions of many polymers. Formation 
of nontrivial mesoscopic globules in copolymer solutions at low concentrations has been predicted in the framework of 
the method |S] and recently observed experimentally (§(]] . Generally, the GSC method may be viewed as extension to 
the realm of kinetics of the variational treatment, which is a better theory than standard mean-field approaches due 
to a much larger number of variational parameters. We should admit, however, that in the complete form numerical 
solution of the GSC equations is computationally expensive for systems of a few hundred of monomers. Nevertheless, 
as we have recently shown p9[ , in some approximation the GSC equations can be reduced to those of a simple mean- 
field theory such as the Flory-Huggins one, and some extra corrections to the latter. Importantly, the formalism in 
terms of the mean-squared distances is valid for description of the extended coil as well as of the globular states, 
whereas the density formalism theories are limited to relatively high densities of the system. The weakest point of the 
method is the Gaussian form of the trial Hamiltonian, and thus of the correlation functions, a matter that has been 
discussed at some length in the past Pajy| . However, based on the covariant form of the kinetic equations derived 
here, we hope, in some not too distant future, to take into account the non-Gaussian corrections in an analogy to the 
treatment of the hard-core repulsion for van der Waals systems. 

The main strength of the revised GSC approach lies in the ability to describe a given heteropolymer sequence 
of finite length, rather than an infinite ensemble of sequences characterised in a certain probabilistic way. This is 
an inevitable preliminary step in developing adequate techniques for theoretical modelling of complex biopolymers. 
Understanding of the relation between the chemical composition (primary sequence) and the 3-d equilibrium (tetriary) 
structure, as well as of the kinetics of folding, in proteins is one of the great challenges of the modern biotechnological 
science. We hope that methods like the one we have presented here will take their right place in the collection of new 
tools for bioinformatics. 
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APPENDIX A: INCLUSION OF HYDRODYNAMICS 

The Langevin equation with account of the hydrodynamic interaction includes the Oseen tensor 

j t x%= Cm (ai) 

a' ,n' 

where 0" is the right-hand side of Eq. (Q) and the noise has the second momentum proportional to the inverse Oseen 
tensor. 

In the preaveraged approximation we have, 

Ittou \ _ S.ou p c _ 3mm' 1 ~ Smm' ( AO\ 

\ rl mra'l ~° Smm' j Um' — * T 1/2' *■ > 

Cb 3(2^)1/2^^, 

and the analogs of formulae (A5,A6) of Ref. will be, 
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2 dt 

m' 

r„„' = - 1 Bnm" "HIT" — = k B TS nn , + V" D n m" K'm" ■ (A4) 
3 *-f dD n , m n *—f 

Explicit expressions for a few first terms in the virial expansion of the effective potentials, 

V fA5) 

Vmm> — „ „ n (AO) 
O UJ-J mm > 



may be found in Appendix |c[ 

APPENDIX B: THE VARIATIONAL PRINCIPLE 

We introduce the trace as the integration over all monomers coordinates subject to the constraint that the reference 
point is fixed in the centre-of-mass of the chain, 

- iV-l 

Tr= / JJ dX„<5(^X„). (Bl) 

J n=0 n 

The n the partition function is obtained as Z to t = Trexp(— H/k B T) with H given by Eq. (H). The delta-function in 



(Bl) removes the trivial divergence in Z to t due to the translational invariance. 

bitrary linear combination, 

~Z ^ ^ k-mm'~K-m'K-m,' ~\~ ^ ] Jm^m; (-^2) 



We choose the trial Hamiltonian as an arbitrary linear combination, 

Ho 



k B T 

■mill in 

where we have also introduced arbitrary sources J m . Using the delta-function one can exclude, say, Xq and derive 



Z [J] = ^^^(deti^-^-^exp 



\ mm' / 



where the N — 1 dimensional matrix K is 

(K) nn , = K nn , + K 0Q - K n0 - K 0n > ■ (B4) 

We can calculate the averages by, 



T - 1 (XX \ l dHog Z a [J] _ - 



where again the latter identity holds only for n, n' 0. From these quantities it is straightforward to obtain the mean 
squared distances using Eq. (^) which also holds only for n,n' ^ 0. 

Finally, we want to express all independent parameters of the matrix K via the quantities D mm >. For this we 
compute the following sums applying the above delta-function constraint, 

^ ^ Dmm' 2./V ^ ^ J~ inmj ^ ^ D mn 27V ^ ; ^mm' ~\~ N J~ nn . (-^6) 

mm' m m mm' 

Substituting T nn from the second formula here to Eq. (|12|) and recalling the relation (g) we derive the desired inverse 



relation (12) for (K ')„„/ = T nn ' ■ 



From the partition function (B3) we obtain the "entropic" term Aq = — TS = —k B T logZo[0] that yields precisely 
Eq. (|j"T|). The Gibbs-Bogoliubov variational principle is then based on minimising this variational free energy 
A = Aq + (H — Hq)q with respect to the N(N — l)/2 independent variational parameters D mm i . 
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1. Entropy of the homopolymer 



For the homopolymer we have the translational invariance along the chain so that D nn i = Dk, where k = \n — n'\ 



Then the matrix (12) may be rewritten as, 



Rjin' — Rn ~ D nn ' , R 



b 2 ' 9 2N 

k 



^E^- ( B7 ) 



Let us apply a unitary transformation to the Fourier coordinates generated by the matrix, 

1 ( 2niqn\ ,„ . 

^-A^ eXP ("^J- (B8) 

By a direct evaluation and recalling the relation for the normal modes, 

-Wl^E Rc ^=^' (B9) 
k 

we see that the matrix R = /t • R • / is diagonal with the eigenvalues Rqq = and R qq — N!F q . Thus, the logarithm 

of determinant of the (N — l)-dimensional submatrix det' R yields the standard "entropy" of the homopolymer (see 
Eq. (3) in Ref. [jl7|) up to a trivial constant. 



APPENDIX C: FORMULAE FOR THE EFFECTIVE POTENTIALS 



Two first terms in the virial expansion of the derivatives (A.5) of the mean energy are explicitly given by. 



= (1- S A- 

v mm' ~ \ L u mm' ) 



-.(2) 



D 



5/2 



e=(i-MM w e 



E 

n^ra 

D 



-(2) 

" Hi II. 



D 



5/2 ' 



(detAW)^. 



E 



(detA(2))^„, 



(Cl) 
(C2) 



where (det A( 2 h i » — D , D , — D 2 

wucic L-i )mm'm n — ^mm' ■ LJ m' m' mm m m 

It is usually assumed that for negative the stability of the system is ensured by the positiveness of u^ 3 \ It 

(2) 

turns out however that for the whole range of u mm , this does not hold and there are additional pathological solutions 
with singular free energy. The reason for this deficiency of the model (||) is that we have discarded the terms with two 
coinciding indices in the three-body contribution. This standard trick is not satisfactory therefore, but fortunately it 
could be remedied by using another prescription for these terms: 



£3 = M (3) ^ (S(X m - X m ') S(X m " - Xm>) 



3u 



(3) 



E 

ra^ra' 



(C3) 



In Ref. [[31j we show that the addition of the latter term, which is subdominant in the large N limit anyway, does not 
change the earlier results for the homopolymer, and even quantitatively improves the agreement with known results 
for the dense globular state gj. Importantly, for heteropolymers this term removes spurious solutions and makes the 
theory well defined. The corresponding contributions to the mean energy and the effective potentials are, 



3fi< 3 > D 



-3 

rata' ' 



V^ = (1 - 5mm')6u^D, 



-4 
mm' 



(3) 



/ j ran 
n^ra 



(C4) 
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FIG. 1. The phase diagram of copolymer sequence 30(ab) ("short" blocks) in terms of the mean second virial coefficient, 
vP\ and the amphiphilicity, A (both in units ksTC 3 ). Curves (C) and (S) correspond respectively to the collapse and the 
micro-phase separation continuous transitions. Curves (I) and (II) correspond to discontinuous transitions to the frustrated 
phases. "Spinodal" curves (F) and (II") bound the regions of metastability of the frustrated states. Transition curves and 
boundaries distinguishing different frustrated sates are not depicted. 
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FIG. 2. The phase diagram of copolymer sequence lO(aaabbb) ("long" blocks) in terms of the mean second virial coefficient, 
vP\ and the amphiphilicity, A (both in units ksTC 3 ). For large values of A the collapse transition becomes discontinuous 
(curve (I)) and it is accompanied by micro-phase separation (see also Figs. M-ra). Curves (F) and (I") are spinodals. 



FIG. 3. The phase diagram of the "random" sequence 2(abacbbcabccbcaaacbcccbbaacbcca) in terms of the mean second 
virial coefficient, u^ 2 \ and the amphiphilicity, A (both in units ksTC 3 ). Other notations are as in Fig. 

FIG. 4. Diagrams of the mean squared distances matrix D mm i for the "short" blocks copolymer 30(ab) at A = 20. Diagrams 
(a-d) correspond respectively to w' 2 ' = 15, —21, —30 and —40 (in units fcsT£ 3 ). Indices m, m! start counting from the upper 
left corner. Each matrix element, D mm i is denoted by a quadratic cell with varying degree of black colour, the darkest and the 
lightest cells corresponding respectively to the smallest and to the largest mean squared distances. The diagonal elements are 
not painted since D mm = 0. For the coil (Fig. a) D mm i elements increase monotonically on moving away from the diagonal 
towards half-ring distance along the chain. In frustrated states (Figs. b,c) D mm i possesses some number of clusters with 
monomers having smaller distances between each other. For the MPS globule (Fig. d) D mm i reflects the structure of the 
two-body interaction matrix u , and consists of similar elementary cells. 

J mm J 

FIG. 5. Plot of the mean squared radius of gyration, R 2 , (in units C 2 ), vs the amphiphilicity, A (in units ksTC 3 ), for 
different sequences (from top to bottom): "long" blocks, "short" blocks and the "random" sequence. Here we have fixed 

FIG. 6. Plot of the parameter of micro-phase separation, <E', vs the amphiphilicity, A (in units ksTC ), for different 
sequences: "long" blocks (pluses), "short" blocks (diamonds) and the "random" sequence (quadrangles). Here we have fixed 

FIG. 7. Plot of the mean squared radius of gyration, R 2 (in units £ 2 ), vs the second virial coefficient, (in units fcsT£ 3 ), 
for lO(aaabbb) copolymer. Here and in Figs. p|-|lo| A = 30, solid lines correspond to values of observables in the main free 
energy minimum and dashed lines — in the metastable minima. 

FIG. 8. Plot of the parameter of micro-phase separation, ty, vs the second virial coefficient, (in units kBTC 3 ), for 
10(aaabbb) copolymer. 



FIG. 9. Plot of the mean squared radius of gyration, R 2 (in units £ 2 ), vs the second virial coefficient, tr 2 ' (in units ksTC 3 ), 
for the "random" sequence. 

FIG. 10. Plot of the parameter of micro-phase separation, vs the second virial coefficient, vS 2 ^ (in units fceT£ 3 ), for the 
"random" sequence. 

FIG. 11. Time evolution (t is in units T) of the mean squared radius of gyration, R 2 (in units £ 2 ), for different sequences 
after an instantaneous quench from the coil state, = 15 and A = 0, to the region with u^ 2 ' = —25 and A = 30. 

FIG. 12. Time evolution (t is in units T) of the parameter of micro-phase separation, $ ', for different sequences after the 
same quench as in Fig. |ll]. 



FIG. 13. Time evolution (t is in units T) of the instantaneous free energy, A (in units /cbT), for different sequences after 
the same quench as in Fig. O. 
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FIG. 14. Time evolution (t is in units T) of the mean squared distances between nearest 'a' monomers D2fc,2fc+2(i) (in units 
C 2 ) for 30(ab) copolymer in kinetics after the quench with the final two-body parameters: vP^ = —50 and A = 30. 

FIG. 15. Diagrams of D mm i (t) matrix for the "short" blocks copolymer 30(ab) in kinetics after the same quench as in 
Fig. jl4| Diagrams (a-c) correspond respectively to the following moments in time: t = 4.6, 11, and 12.9. See also caption 
to Fig. ^ for more details. The kinetic process proceeds through formation of locally collapsed and phase-separated clusters. 
The initial conformation is similar to Fig. Ma, then some clusters appear, coalesce into larger ones, until they eventually unify 
forming the MPS globule (similar to Fig. |4|d) . 
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